On Using a Benchmark to Evaluate C++ Extractors

نویسندگان

  • Susan Elliott Sim
  • Richard C. Holt
  • Steve M. Easterbrook
چکیده

In this paper, we take the concept of benchmarking as used extensively in computing and apply it to evaluating C++ fact extractors. We demonstrated the efficacy of this approach by developing a prototype benchmark, CppETS 1.0 (C++ Extractor Test Suite, pronounced see-pets) and collecting feedback in a workshop setting. The CppETS benchmark characterises C++ extractors along two dimensions: Accuracy and Robustness. It consists of a series of test buckets that contain small C++ programs and related questions that pose different challenges to the extractors. As with other research areas, benchmarks are best developed through technical work and consultation with a community, so we invited researchers to apply CppETS to their extractors and report on their results in a workshop. Four teams participated in this effort, evaluating Ccia, cppx, the Rigi C++ parser, and TkSee/SN. They found that CppETS gave results that were consistent with their experience with these tools and therefore had good external validity. Workshop participants agreed that CppETS was an important contribution to fact extractor development and testing. Further efforts to make CppETS a widely-accepted benchmark will involve technical improvements and collaboration with the broader community.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the development of a sliding mode observer-based fault diagnosis scheme for a wind turbine benchmark model

This paper addresses the design of an observer-based fault diagnosis scheme, which is applied to some of the sensors and actuators of a wind turbine benchmark model. The methodology is based on a modified sliding mode observer (SMO) that allows accurate reconstruction of multiple sensor or actuator faults occurring simultaneously. The faults are reconstructed using the equivalent output err...

متن کامل

On the development of a sliding mode observer-based fault diagnosis scheme for a wind turbine benchmark model

This paper addresses the design of an observer-based fault diagnosis scheme, which is applied to some of the sensors and actuators of a wind turbine benchmark model. The methodology is based on a modified sliding mode observer (SMO) that allows accurate reconstruction of multiple sensor or actuator faults occurring simultaneously. The faults are reconstructed using the equivalent output err...

متن کامل

Multi-column deep neural network for traffic sign classification

We describe the approach that won the final phase of the German traffic sign recognition benchmark. Our method is the only one that achieved a better-than-human recognition rate of 99.46%. We use a fast, fully parameterizable GPU implementation of a Deep Neural Network (DNN) that does not require careful design of pre-wired feature extractors, which are rather learned in a supervised way. Combi...

متن کامل

Learning audio and image representations with bio-inspired trainable feature extractors

Recent advancements in pattern recognition and signal processing concern the automatic learning of data representations from labeled training samples. Typical approaches are based on deep learning and convolutional neural networks, which require large amount of labeled training samples. In this work, we propose novel feature extractors that can be used to learn the representation of single prot...

متن کامل

An Image-Based Approach to Visual Feature Space Analysis

Methods for management and analysis of non-standard data often rely on the so-called feature vector approach. The technique describes complex data instances by vectors of characteristic numeric values which allow to index the data and to calculate similarity scores between the data elements. Thereby, feature vectors often are a key ingredient to intelligent data analysis algorithms including in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002